Overview
Brought to you by YData
Dataset statistics
| Number of variables | 14 |
|---|---|
| Number of observations | 532825 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 56.9 MiB |
| Average record size in memory | 112.0 B |
Variable types
| Numeric | 11 |
|---|---|
| Categorical | 3 |
accnt_bgn_date is highly overall correlated with accnt_bgn_year | High correlation |
accnt_bgn_year is highly overall correlated with accnt_bgn_date | High correlation |
cprtn_prd_d is highly overall correlated with erly_pnsn_flg | High correlation |
erly_pnsn_flg is highly overall correlated with cprtn_prd_d and 2 other fields | High correlation |
gndr is highly overall correlated with pnsn_age | High correlation |
pnsn_age is highly overall correlated with erly_pnsn_flg and 1 other fields | High correlation |
prsnt_age is highly overall correlated with erly_pnsn_flg | High correlation |
erly_pnsn_flg is highly imbalanced (77.5%) | Imbalance |
Reproduction
| Analysis started | 2024-10-27 06:21:14.563044 |
|---|---|
| Analysis finished | 2024-10-27 06:22:01.579988 |
| Duration | 47.02 seconds |
| Software version | ydata-profiling vv4.11.0 |
| Download configuration | config.json |
Variables
location
Real number (ℝ)
| Distinct | 164734 |
|---|---|
| Distinct (%) | 30.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.036482345 |
| Minimum | 7.2878831 × 10-5 |
|---|---|
| Maximum | 0.93116904 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 MiB |
Quantile statistics
| Minimum | 7.2878831 × 10-5 |
|---|---|
| 5-th percentile | 0.0009324753 |
| Q1 | 0.010909121 |
| median | 0.026499609 |
| Q3 | 0.041558501 |
| 95-th percentile | 0.10502271 |
| Maximum | 0.93116904 |
| Range | 0.93109616 |
| Interquartile range (IQR) | 0.030649379 |
Descriptive statistics
| Standard deviation | 0.049598478 |
|---|---|
| Coefficient of variation (CV) | 1.35952 |
| Kurtosis | 46.619598 |
| Mean | 0.036482345 |
| Median Absolute Deviation (MAD) | 0.015355228 |
| Skewness | 5.4828531 |
| Sum | 19438.706 |
| Variance | 0.002460009 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.03636653686 | 36204 | 6.8% |
| 0.01818326843 | 18178 | 3.4% |
| 0.01212217895 | 12620 | 2.4% |
| 0.009091634214 | 9648 | 1.8% |
| 0.007273307371 | 7786 | 1.5% |
| 0.006061089476 | 6484 | 1.2% |
| 0.005195219551 | 5443 | 1.0% |
| 0.004545817107 | 4680 | 0.9% |
| 0.004040726317 | 4096 | 0.8% |
| 0.003636653686 | 3619 | 0.7% |
| Other values (164724) | 424067 |
| Value | Count | Frequency (%) |
| 7.287883137 × 10-5 | 1 | |
| 7.302517441 × 10-5 | 1 | |
| 7.317210635 × 10-5 | 1 | |
| 7.331963076 × 10-5 | 1 | |
| 7.346775122 × 10-5 | 1 | |
| 7.361647137 × 10-5 | 1 | |
| 7.376579484 × 10-5 | 1 | |
| 7.391572532 × 10-5 | 1 | |
| 7.406626651 × 10-5 | 1 | |
| 7.421742215 × 10-5 | 2 |
| Value | Count | Frequency (%) |
| 0.9311690383 | 1 | < 0.1% |
| 0.925874349 | 1 | < 0.1% |
| 0.9196972114 | 1 | < 0.1% |
| 0.9123969579 | 1 | < 0.1% |
| 0.9036366537 | 1 | < 0.1% |
| 0.8929296152 | 2 | < 0.1% |
| 0.8795458171 | 2 | < 0.1% |
| 0.8623380767 | 2 | < 0.1% |
| 0.8393944228 | 4 | < 0.1% |
| 0.8072733074 | 10 |
addrss_type
Real number (ℝ)
| Distinct | 532802 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.036614107 |
| Minimum | 0.00079057689 |
|---|---|
| Maximum | 0.25909163 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 MiB |
Quantile statistics
| Minimum | 0.00079057689 |
|---|---|
| 5-th percentile | 0.030423404 |
| Q1 | 0.030540759 |
| median | 0.030608328 |
| Q3 | 0.03105949 |
| 95-th percentile | 0.064708336 |
| Maximum | 0.25909163 |
| Range | 0.25830106 |
| Interquartile range (IQR) | 0.00051873071 |
Descriptive statistics
| Standard deviation | 0.013085451 |
|---|---|
| Coefficient of variation (CV) | 0.35738823 |
| Kurtosis | 1.7736994 |
| Mean | 0.036614107 |
| Median Absolute Deviation (MAD) | 0.00012706598 |
| Skewness | 1.7616708 |
| Sum | 19508.912 |
| Variance | 0.00017122902 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.03636653686 | 4 | < 0.1% |
| 0.01212217895 | 4 | < 0.1% |
| 0.01818326843 | 4 | < 0.1% |
| 0.003636653686 | 2 | < 0.1% |
| 0.1272729086 | 2 | < 0.1% |
| 0.1131314743 | 2 | < 0.1% |
| 0.002797425912 | 2 | < 0.1% |
| 0.003030544738 | 2 | < 0.1% |
| 0.003306048805 | 2 | < 0.1% |
| 0.1197862669 | 2 | < 0.1% |
| Other values (532792) | 532799 |
| Value | Count | Frequency (%) |
| 0.0007905768882 | 1 | |
| 0.0008081452635 | 1 | |
| 0.0008265122013 | 1 | |
| 0.0008457334152 | 1 | |
| 0.0008658699251 | 1 | |
| 0.0008869887038 | 1 | |
| 0.0009091634214 | 1 | |
| 0.000932475304 | 1 | |
| 0.0009570141278 | 1 | |
| 0.0009828793745 | 1 |
| Value | Count | Frequency (%) |
| 0.2590916342 | 1 | |
| 0.2072733074 | 1 | |
| 0.2036366537 | 1 | |
| 0.1851242306 | 1 | |
| 0.1727277561 | 1 | |
| 0.1696972114 | 1 | |
| 0.1614546615 | 1 | |
| 0.1598087651 | 1 | |
| 0.1566435798 | 1 | |
| 0.1552448668 | 1 |
prvs_npf
Real number (ℝ)
| Distinct | 522060 |
|---|---|
| Distinct (%) | 98.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.036455654 |
| Minimum | 0.00018842765 |
|---|---|
| Maximum | 0.7818185 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 MiB |
Quantile statistics
| Minimum | 0.00018842765 |
|---|---|
| 5-th percentile | 0.025398991 |
| Q1 | 0.025536612 |
| median | 0.025699073 |
| Q3 | 0.027377749 |
| 95-th percentile | 0.064632776 |
| Maximum | 0.7818185 |
| Range | 0.78163008 |
| Interquartile range (IQR) | 0.0018411373 |
Descriptive statistics
| Standard deviation | 0.038510279 |
|---|---|
| Coefficient of variation (CV) | 1.0563596 |
| Kurtosis | 53.060861 |
| Mean | 0.036455654 |
| Median Absolute Deviation (MAD) | 0.00027316601 |
| Skewness | 6.6112815 |
| Sum | 19424.484 |
| Variance | 0.0014830416 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.03636653686 | 184 | < 0.1% |
| 0.01818326843 | 140 | < 0.1% |
| 0.01212217895 | 118 | < 0.1% |
| 0.009091634214 | 99 | < 0.1% |
| 0.007273307371 | 89 | < 0.1% |
| 0.006061089476 | 83 | < 0.1% |
| 0.005195219551 | 72 | < 0.1% |
| 0.004545817107 | 64 | < 0.1% |
| 0.004040726317 | 58 | < 0.1% |
| 0.003636653686 | 54 | < 0.1% |
| Other values (522050) | 531864 |
| Value | Count | Frequency (%) |
| 0.0001884276521 | 1 | |
| 0.0001894090461 | 1 | |
| 0.0001904007165 | 1 | |
| 0.0001914028256 | 2 | |
| 0.0001924155389 | 2 | |
| 0.0001934390258 | 2 | |
| 0.0001944734591 | 2 | |
| 0.0001955190154 | 2 | |
| 0.0001965758749 | 2 | |
| 0.000197644222 | 2 |
| Value | Count | Frequency (%) |
| 0.7818185041 | 1 | < 0.1% |
| 0.7590916342 | 2 | < 0.1% |
| 0.7545458171 | 1 | < 0.1% |
| 0.7530305447 | 1 | < 0.1% |
| 0.7305787761 | 1 | < 0.1% |
| 0.7194809338 | 1 | < 0.1% |
| 0.7036366537 | 1 | < 0.1% |
| 0.6951051182 | 1 | < 0.1% |
| 0.6897729086 | 1 | < 0.1% |
| 0.6787888456 | 6 |
brth_plc
Real number (ℝ)
| Distinct | 23020 |
|---|---|
| Distinct (%) | 4.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.03738081 |
| Minimum | 5.3245296 × 10-5 |
|---|---|
| Maximum | 0.93116904 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 MiB |
Quantile statistics
| Minimum | 5.3245296 × 10-5 |
|---|---|
| 5-th percentile | 0.0020203632 |
| Q1 | 0.018183268 |
| median | 0.036366537 |
| Q3 | 0.036366537 |
| 95-th percentile | 0.083499925 |
| Maximum | 0.93116904 |
| Range | 0.93111579 |
| Interquartile range (IQR) | 0.018183268 |
Descriptive statistics
| Standard deviation | 0.048824737 |
|---|---|
| Coefficient of variation (CV) | 1.3061444 |
| Kurtosis | 61.450947 |
| Mean | 0.03738081 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.8018222 |
| Sum | 19917.43 |
| Variance | 0.0023838549 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.03636653686 | 309811 | |
| 0.01818326843 | 37382 | 7.0% |
| 0.01212217895 | 17188 | 3.2% |
| 0.009091634214 | 10763 | 2.0% |
| 0.007273307371 | 7550 | 1.4% |
| 0.006061089476 | 5772 | 1.1% |
| 0.005195219551 | 4650 | 0.9% |
| 0.004545817107 | 3854 | 0.7% |
| 0.004040726317 | 3281 | 0.6% |
| 0.003636653686 | 2825 | 0.5% |
| Other values (23010) | 129749 |
| Value | Count | Frequency (%) |
| 5.324529554 × 10-5 | 1 | |
| 5.332336782 × 10-5 | 1 | |
| 5.340166939 × 10-5 | 1 | |
| 5.348020126 × 10-5 | 1 | |
| 5.355896444 × 10-5 | 1 | |
| 5.363795996 × 10-5 | 1 | |
| 5.371718886 × 10-5 | 1 | |
| 5.379665215 × 10-5 | 1 | |
| 5.38763509 × 10-5 | 1 | |
| 5.395628614 × 10-5 | 1 |
| Value | Count | Frequency (%) |
| 0.9311690383 | 1 | < 0.1% |
| 0.925874349 | 1 | < 0.1% |
| 0.9196972114 | 1 | < 0.1% |
| 0.9123969579 | 1 | < 0.1% |
| 0.9036366537 | 2 | |
| 0.8929296152 | 2 | |
| 0.8795458171 | 2 | |
| 0.8690911025 | 1 | < 0.1% |
| 0.8623380767 | 4 | |
| 0.8597404669 | 1 | < 0.1% |
okato
Real number (ℝ)
| Distinct | 460459 |
|---|---|
| Distinct (%) | 86.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.036500988 |
| Minimum | 7.2733074 × 10-5 |
|---|---|
| Maximum | 0.67878885 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 MiB |
Quantile statistics
| Minimum | 7.2733074 × 10-5 |
|---|---|
| 5-th percentile | 0.011484472 |
| Q1 | 0.022463894 |
| median | 0.030692252 |
| Q3 | 0.043769427 |
| 95-th percentile | 0.083431622 |
| Maximum | 0.67878885 |
| Range | 0.67871611 |
| Interquartile range (IQR) | 0.021305533 |
Descriptive statistics
| Standard deviation | 0.022692711 |
|---|---|
| Coefficient of variation (CV) | 0.62170128 |
| Kurtosis | 21.411154 |
| Mean | 0.036500988 |
| Median Absolute Deviation (MAD) | 0.010589581 |
| Skewness | 2.4236305 |
| Sum | 19448.639 |
| Variance | 0.00051495912 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.03636653686 | 81 | < 0.1% |
| 0.01818326843 | 73 | < 0.1% |
| 0.01212217895 | 69 | < 0.1% |
| 0.009091634214 | 67 | < 0.1% |
| 0.007273307371 | 65 | < 0.1% |
| 0.006061089476 | 62 | < 0.1% |
| 0.005195219551 | 61 | < 0.1% |
| 0.004545817107 | 59 | < 0.1% |
| 0.004040726317 | 57 | < 0.1% |
| 0.003636653686 | 55 | < 0.1% |
| Other values (460449) | 532176 |
| Value | Count | Frequency (%) |
| 7.273307371 × 10-5 | 1 | |
| 7.287883137 × 10-5 | 1 | |
| 7.302517441 × 10-5 | 1 | |
| 7.317210635 × 10-5 | 1 | |
| 7.331963076 × 10-5 | 1 | |
| 7.346775122 × 10-5 | 1 | |
| 7.361647137 × 10-5 | 1 | |
| 7.376579484 × 10-5 | 1 | |
| 7.391572532 × 10-5 | 1 | |
| 7.406626651 × 10-5 | 1 |
| Value | Count | Frequency (%) |
| 0.6787888456 | 1 | < 0.1% |
| 0.576623791 | 1 | < 0.1% |
| 0.5487605943 | 1 | < 0.1% |
| 0.5198654273 | 1 | < 0.1% |
| 0.5181832684 | 7 | |
| 0.5090916342 | 2 | < 0.1% |
| 0.5060610895 | 1 | < 0.1% |
| 0.5045458171 | 1 | < 0.1% |
| 0.5036366537 | 1 | < 0.1% |
| 0.5030305447 | 1 | < 0.1% |
gndr
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 29.5 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 343320 | |
| 1 | 189505 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 343320 | |
| 1 | 189505 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 343320 | |
| 1 | 189505 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 532825 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 343320 | |
| 1 | 189505 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 532825 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 343320 | |
| 1 | 189505 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 532825 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 343320 | |
| 1 | 189505 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 448437 | |
| 1 | 84388 | 15.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 448437 | |
| 1 | 84388 | 15.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 448437 | |
| 1 | 84388 | 15.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 532825 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 448437 | |
| 1 | 84388 | 15.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 532825 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 448437 | |
| 1 | 84388 | 15.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 532825 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 448437 | |
| 1 | 84388 | 15.8% |
accnt_bgn_date
Real number (ℝ)
High correlation 
| Distinct | 4070 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.2432191 × 1018 |
| Minimum | 1.0933056 × 1018 |
|---|---|
| Maximum | 1.7005248 × 1018 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 MiB |
Quantile statistics
| Minimum | 1.0933056 × 1018 |
|---|---|
| 5-th percentile | 1.1291616 × 1018 |
| Q1 | 1.1647584 × 1018 |
| median | 1.2413952 × 1018 |
| Q3 | 1.308528 × 1018 |
| 95-th percentile | 1.384992 × 1018 |
| Maximum | 1.7005248 × 1018 |
| Range | 6.072192 × 1017 |
| Interquartile range (IQR) | 1.437696 × 1017 |
Descriptive statistics
| Standard deviation | 8.8614196 × 1016 |
|---|---|
| Coefficient of variation (CV) | 0.071278019 |
| Kurtosis | -0.68610523 |
| Mean | 1.2432191 × 1018 |
| Median Absolute Deviation (MAD) | 7.48224 × 1016 |
| Skewness | 0.30804657 |
| Sum | -4.3471845 × 1018 |
| Variance | 7.8524758 × 1033 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.0965024 × 1018 | 6070 | 1.1% |
| 1.2936672 × 1018 | 3788 | 0.7% |
| 1.1359008 × 1018 | 3671 | 0.7% |
| 1.1673504 × 1018 | 3590 | 0.7% |
| 1.096416 × 1018 | 3320 | 0.6% |
| 1.2937536 × 1018 | 2826 | 0.5% |
| 1.1352096 × 1018 | 2708 | 0.5% |
| 1.1343456 × 1018 | 2605 | 0.5% |
| 1.1988864 × 1018 | 2456 | 0.5% |
| 1.1527488 × 1018 | 2434 | 0.5% |
| Other values (4060) | 499357 |
| Value | Count | Frequency (%) |
| 1.0933056 × 1018 | 2 | < 0.1% |
| 1.093392 × 1018 | 4 | < 0.1% |
| 1.0934784 × 1018 | 39 | |
| 1.0935648 × 1018 | 5 | < 0.1% |
| 1.093824 × 1018 | 6 | < 0.1% |
| 1.0939104 × 1018 | 6 | < 0.1% |
| 1.0939968 × 1018 | 3 | < 0.1% |
| 1.0940832 × 1018 | 1 | < 0.1% |
| 1.0941696 × 1018 | 5 | < 0.1% |
| 1.0944288 × 1018 | 14 | < 0.1% |
| Value | Count | Frequency (%) |
| 1.7005248 × 1018 | 1 | |
| 1.6971552 × 1018 | 1 | |
| 1.6680384 × 1018 | 1 | |
| 1.6657056 × 1018 | 1 | |
| 1.663632 × 1018 | 1 | |
| 1.6375392 × 1018 | 1 | |
| 1.6354656 × 1018 | 1 | |
| 1.6255296 × 1018 | 1 | |
| 1.6215552 × 1018 | 1 | |
| 1.621296 × 1018 | 1 |
pstl_code
Real number (ℝ)
| Distinct | 29036 |
|---|---|
| Distinct (%) | 5.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 458406.58 |
| Minimum | 0 |
|---|---|
| Maximum | 976974 |
| Zeros | 52 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 150051 |
| Q1 | 353380 |
| median | 443115 |
| Q3 | 627750 |
| 95-th percentile | 674500 |
| Maximum | 976974 |
| Range | 976974 |
| Interquartile range (IQR) | 274370 |
Descriptive statistics
| Standard deviation | 177726.58 |
|---|---|
| Coefficient of variation (CV) | 0.38770513 |
| Kurtosis | -1.0466921 |
| Mean | 458406.58 |
| Median Absolute Deviation (MAD) | 180156 |
| Skewness | -0.37630054 |
| Sum | 2.4425049 × 1011 |
| Variance | 3.1586738 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 162600 | 1278 | 0.2% |
| 623428 | 1128 | 0.2% |
| 624480 | 1089 | 0.2% |
| 398046 | 1077 | 0.2% |
| 162609 | 966 | 0.2% |
| 398036 | 954 | 0.2% |
| 429965 | 849 | 0.2% |
| 162626 | 842 | 0.2% |
| 670000 | 820 | 0.2% |
| 620050 | 733 | 0.1% |
| Other values (29026) | 523089 |
| Value | Count | Frequency (%) |
| 0 | 52 | |
| 23 | 1 | < 0.1% |
| 27 | 1 | < 0.1% |
| 454 | 1 | < 0.1% |
| 2600 | 1 | < 0.1% |
| 2624 | 1 | < 0.1% |
| 4531 | 1 | < 0.1% |
| 6025 | 1 | < 0.1% |
| 6323 | 1 | < 0.1% |
| 11396 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 976974 | 1 | |
| 964620 | 1 | |
| 962036 | 1 | |
| 943250 | 1 | |
| 925220 | 2 | |
| 924930 | 1 | |
| 920050 | 1 | |
| 906012 | 1 | |
| 882081 | 1 | |
| 851690 | 1 |
cprtn_prd_d
Real number (ℝ)
High correlation 
| Distinct | 5848 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 324.07322 |
| Minimum | 0 |
|---|---|
| Maximum | 7269 |
| Zeros | 551 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 87 |
| Q1 | 108 |
| median | 182 |
| Q3 | 326 |
| 95-th percentile | 771 |
| Maximum | 7269 |
| Range | 7269 |
| Interquartile range (IQR) | 218 |
Descriptive statistics
| Standard deviation | 581.01115 |
|---|---|
| Coefficient of variation (CV) | 1.7928392 |
| Kurtosis | 44.227609 |
| Mean | 324.07322 |
| Median Absolute Deviation (MAD) | 87 |
| Skewness | 6.1400166 |
| Sum | 1.7267431 × 108 |
| Variance | 337573.96 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 91 | 10411 | 2.0% |
| 90 | 7386 | 1.4% |
| 93 | 7273 | 1.4% |
| 94 | 6834 | 1.3% |
| 87 | 6731 | 1.3% |
| 95 | 6662 | 1.3% |
| 92 | 6525 | 1.2% |
| 88 | 6268 | 1.2% |
| 98 | 6145 | 1.2% |
| 99 | 5736 | 1.1% |
| Other values (5838) | 462854 |
| Value | Count | Frequency (%) |
| 0 | 551 | |
| 4 | 1 | < 0.1% |
| 14 | 1 | < 0.1% |
| 17 | 1 | < 0.1% |
| 23 | 1 | < 0.1% |
| 29 | 2 | < 0.1% |
| 32 | 5 | < 0.1% |
| 33 | 1 | < 0.1% |
| 35 | 1 | < 0.1% |
| 36 | 14 | < 0.1% |
| Value | Count | Frequency (%) |
| 7269 | 1 | |
| 7260 | 1 | |
| 7252 | 1 | |
| 7249 | 1 | |
| 7247 | 1 | |
| 7238 | 2 | |
| 7236 | 1 | |
| 7231 | 1 | |
| 7219 | 1 | |
| 7215 | 1 |
prsnt_age
Real number (ℝ)
High correlation 
| Distinct | 62 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 64.142499 |
| Minimum | 37 |
|---|---|
| Maximum | 99 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 MiB |
Quantile statistics
| Minimum | 37 |
|---|---|
| 5-th percentile | 59 |
| Q1 | 62 |
| median | 64 |
| Q3 | 66 |
| 95-th percentile | 70 |
| Maximum | 99 |
| Range | 62 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 3.6932793 |
|---|---|
| Coefficient of variation (CV) | 0.057579286 |
| Kurtosis | 1.4775444 |
| Mean | 64.142499 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.062734683 |
| Sum | 34176727 |
| Variance | 13.640312 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 64 | 70967 | |
| 65 | 65047 | |
| 66 | 58031 | |
| 67 | 47351 | |
| 63 | 40049 | |
| 62 | 39590 | |
| 61 | 38324 | |
| 60 | 36023 | |
| 59 | 34879 | |
| 68 | 24172 | 4.5% |
| Other values (52) | 78392 |
| Value | Count | Frequency (%) |
| 37 | 1 | < 0.1% |
| 39 | 1 | < 0.1% |
| 40 | 2 | < 0.1% |
| 41 | 2 | < 0.1% |
| 42 | 4 | < 0.1% |
| 43 | 2 | < 0.1% |
| 44 | 8 | < 0.1% |
| 45 | 15 | < 0.1% |
| 46 | 33 | |
| 47 | 74 |
| Value | Count | Frequency (%) |
| 99 | 2 | < 0.1% |
| 98 | 1 | < 0.1% |
| 97 | 1 | < 0.1% |
| 96 | 1 | < 0.1% |
| 95 | 2 | < 0.1% |
| 94 | 1 | < 0.1% |
| 93 | 4 | |
| 92 | 2 | < 0.1% |
| 91 | 8 | |
| 90 | 6 |
pnsn_age
Real number (ℝ)
High correlation 
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 57.073424 |
| Minimum | 55 |
|---|---|
| Maximum | 65 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 MiB |
Quantile statistics
| Minimum | 55 |
|---|---|
| 5-th percentile | 55 |
| Q1 | 55 |
| median | 55 |
| Q3 | 60 |
| 95-th percentile | 61 |
| Maximum | 65 |
| Range | 10 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 2.5669216 |
|---|---|
| Coefficient of variation (CV) | 0.044975778 |
| Kurtosis | -1.0745384 |
| Mean | 57.073424 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.64548734 |
| Sum | 30410147 |
| Variance | 6.5890867 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 55 | 294227 | |
| 60 | 162647 | |
| 56 | 34852 | 6.5% |
| 61 | 31149 | 5.8% |
| 65 | 4923 | 0.9% |
| 59 | 4765 | 0.9% |
| 58 | 130 | < 0.1% |
| 63 | 77 | < 0.1% |
| 64 | 55 | < 0.1% |
| Value | Count | Frequency (%) |
| 55 | 294227 | |
| 56 | 34852 | 6.5% |
| 58 | 130 | < 0.1% |
| 59 | 4765 | 0.9% |
| 60 | 162647 | |
| 61 | 31149 | 5.8% |
| 63 | 77 | < 0.1% |
| 64 | 55 | < 0.1% |
| 65 | 4923 | 0.9% |
| Value | Count | Frequency (%) |
| 65 | 4923 | 0.9% |
| 64 | 55 | < 0.1% |
| 63 | 77 | < 0.1% |
| 61 | 31149 | 5.8% |
| 60 | 162647 | |
| 59 | 4765 | 0.9% |
| 58 | 130 | < 0.1% |
| 56 | 34852 | 6.5% |
| 55 | 294227 |
accnt_bgn_year
Real number (ℝ)
High correlation 
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2008.713 |
| Minimum | 2004 |
|---|---|
| Maximum | 2023 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 MiB |
Quantile statistics
| Minimum | 2004 |
|---|---|
| 5-th percentile | 2005 |
| Q1 | 2006 |
| median | 2009 |
| Q3 | 2011 |
| 95-th percentile | 2013 |
| Maximum | 2023 |
| Range | 19 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 2.8627334 |
|---|---|
| Coefficient of variation (CV) | 0.001425158 |
| Kurtosis | -0.77361452 |
| Mean | 2008.713 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.25320765 |
| Sum | 1.0702925 × 109 |
| Variance | 8.1952426 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2006 | 75564 | |
| 2010 | 71690 | |
| 2005 | 64622 | |
| 2011 | 59284 | |
| 2007 | 55495 | |
| 2009 | 46483 | |
| 2013 | 44962 | |
| 2008 | 41979 | |
| 2012 | 40186 | |
| 2004 | 18691 | 3.5% |
| Other values (10) | 13869 | 2.6% |
| Value | Count | Frequency (%) |
| 2004 | 18691 | 3.5% |
| 2005 | 64622 | |
| 2006 | 75564 | |
| 2007 | 55495 | |
| 2008 | 41979 | |
| 2009 | 46483 | |
| 2010 | 71690 | |
| 2011 | 59284 | |
| 2012 | 40186 | |
| 2013 | 44962 |
| Value | Count | Frequency (%) |
| 2023 | 2 | < 0.1% |
| 2022 | 3 | < 0.1% |
| 2021 | 13 | < 0.1% |
| 2020 | 30 | < 0.1% |
| 2019 | 46 | < 0.1% |
| 2018 | 622 | 0.1% |
| 2017 | 1233 | 0.2% |
| 2016 | 2775 | |
| 2015 | 6405 | |
| 2014 | 2740 |
erly_pnsn_flg
Categorical
High correlation  Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 29.5 MiB |
| 0 | |
|---|---|
| 1 | 19377 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 513448 | |
| 1 | 19377 | 3.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 513448 | |
| 1 | 19377 | 3.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 513448 | |
| 1 | 19377 | 3.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 532825 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 513448 | |
| 1 | 19377 | 3.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 532825 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 513448 | |
| 1 | 19377 | 3.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 532825 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 513448 | |
| 1 | 19377 | 3.6% |
Interactions
Correlations
| accnt_bgn_date | accnt_bgn_year | addrss_type | brth_plc | cprtn_prd_d | erly_pnsn_flg | gndr | lk | location | okato | pnsn_age | prsnt_age | prvs_npf | pstl_code | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| accnt_bgn_date | 1.000 | 0.994 | 0.256 | 0.090 | 0.347 | 0.485 | 0.173 | 0.236 | 0.059 | 0.019 | -0.076 | -0.253 | 0.095 | 0.023 |
| accnt_bgn_year | 0.994 | 1.000 | 0.257 | 0.089 | 0.419 | 0.483 | 0.174 | 0.237 | 0.058 | 0.017 | -0.076 | -0.252 | 0.094 | 0.030 |
| addrss_type | 0.256 | 0.257 | 1.000 | 0.055 | 0.069 | 0.068 | 0.103 | 0.076 | 0.026 | 0.019 | -0.041 | -0.097 | 0.312 | -0.012 |
| brth_plc | 0.090 | 0.089 | 0.055 | 1.000 | 0.036 | 0.140 | 0.022 | 0.070 | 0.122 | 0.090 | 0.007 | -0.069 | 0.057 | -0.030 |
| cprtn_prd_d | 0.347 | 0.419 | 0.069 | 0.036 | 1.000 | 0.806 | 0.057 | 0.247 | 0.020 | -0.002 | 0.036 | -0.180 | 0.064 | 0.047 |
| erly_pnsn_flg | 0.485 | 0.483 | 0.068 | 0.140 | 0.806 | 1.000 | 0.037 | 0.250 | 0.157 | 0.092 | 0.721 | 0.662 | 0.185 | 0.066 |
| gndr | 0.173 | 0.174 | 0.103 | 0.022 | 0.057 | 0.037 | 1.000 | 0.062 | 0.018 | 0.025 | 0.963 | 0.498 | 0.071 | 0.024 |
| lk | 0.236 | 0.237 | 0.076 | 0.070 | 0.247 | 0.250 | 0.062 | 1.000 | 0.082 | 0.048 | 0.180 | 0.248 | 0.116 | 0.022 |
| location | 0.059 | 0.058 | 0.026 | 0.122 | 0.020 | 0.157 | 0.018 | 0.082 | 1.000 | 0.457 | -0.005 | -0.068 | 0.061 | -0.057 |
| okato | 0.019 | 0.017 | 0.019 | 0.090 | -0.002 | 0.092 | 0.025 | 0.048 | 0.457 | 1.000 | 0.005 | -0.043 | 0.046 | -0.012 |
| pnsn_age | -0.076 | -0.076 | -0.041 | 0.007 | 0.036 | 0.721 | 0.963 | 0.180 | -0.005 | 0.005 | 1.000 | 0.320 | -0.020 | 0.010 |
| prsnt_age | -0.253 | -0.252 | -0.097 | -0.069 | -0.180 | 0.662 | 0.498 | 0.248 | -0.068 | -0.043 | 0.320 | 1.000 | -0.087 | 0.033 |
| prvs_npf | 0.095 | 0.094 | 0.312 | 0.057 | 0.064 | 0.185 | 0.071 | 0.116 | 0.061 | 0.046 | -0.020 | -0.087 | 1.000 | -0.068 |
| pstl_code | 0.023 | 0.030 | -0.012 | -0.030 | 0.047 | 0.066 | 0.024 | 0.022 | -0.057 | -0.012 | 0.010 | 0.033 | -0.068 | 1.000 |
Missing values
Sample
| location | addrss_type | prvs_npf | brth_plc | okato | gndr | lk | accnt_bgn_date | pstl_code | cprtn_prd_d | prsnt_age | pnsn_age | accnt_bgn_year | erly_pnsn_flg | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0.036367 | 0.036367 | 0.036367 | 0.036367 | 0.036367 | 0 | 0 | 1135123200000000000 | 644001.0 | 96 | 64 | 55 | 2005 | 0 |
| 1 | 0.036367 | 0.018183 | 0.018183 | 0.036367 | 0.036367 | 1 | 0 | 1246233600000000000 | 676852.0 | 283 | 70 | 60 | 2009 | 0 |
| 2 | 0.036367 | 0.012122 | 0.012122 | 0.036367 | 0.036367 | 1 | 0 | 1167004800000000000 | 109451.0 | 88 | 69 | 60 | 2006 | 0 |
| 3 | 0.036367 | 0.009092 | 0.036367 | 0.036367 | 0.036367 | 0 | 0 | 1378166400000000000 | 423464.0 | 1301 | 62 | 55 | 2013 | 0 |
| 4 | 0.036367 | 0.036367 | 0.009092 | 0.036367 | 0.036367 | 1 | 0 | 1291593600000000000 | 427415.0 | 106 | 69 | 60 | 2010 | 0 |
| 5 | 0.036367 | 0.007273 | 0.007273 | 0.036367 | 0.036367 | 0 | 0 | 1354233600000000000 | 623415.0 | 116 | 68 | 55 | 2012 | 0 |
| 6 | 0.036367 | 0.006061 | 0.006061 | 0.036367 | 0.036367 | 0 | 1 | 1311033600000000000 | 636500.0 | 253 | 71 | 55 | 2011 | 0 |
| 7 | 0.036367 | 0.005195 | 0.005195 | 0.036367 | 0.036367 | 0 | 0 | 1155254400000000000 | 633542.0 | 224 | 60 | 55 | 2006 | 0 |
| 8 | 0.036367 | 0.018183 | 0.018183 | 0.036367 | 0.036367 | 0 | 0 | 1239753600000000000 | 453130.0 | 358 | 62 | 55 | 2009 | 0 |
| 9 | 0.036367 | 0.004546 | 0.004546 | 0.036367 | 0.018183 | 0 | 0 | 1196035200000000000 | 624480.0 | 123 | 63 | 55 | 2007 | 0 |
| location | addrss_type | prvs_npf | brth_plc | okato | gndr | lk | accnt_bgn_date | pstl_code | cprtn_prd_d | prsnt_age | pnsn_age | accnt_bgn_year | erly_pnsn_flg | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 532815 | 0.007899 | 0.030452 | 0.025518 | 0.036367 | 0.026559 | 0 | 0 | 1135036800000000000 | 403876.0 | 97 | 64 | 55 | 2005 | 0 |
| 532816 | 0.025966 | 0.030452 | 0.040485 | 0.001102 | 0.029911 | 0 | 0 | 1385769600000000000 | 420078.0 | 544 | 62 | 55 | 2013 | 0 |
| 532817 | 0.003482 | 0.030452 | 0.025518 | 0.000296 | 0.027301 | 0 | 0 | 1142553600000000000 | 457351.0 | 371 | 67 | 55 | 2006 | 0 |
| 532818 | 0.007008 | 0.030452 | 0.025518 | 0.014005 | 0.025558 | 0 | 0 | 1132012800000000000 | 632332.0 | 132 | 60 | 55 | 2005 | 0 |
| 532819 | 0.006995 | 0.030451 | 0.025518 | 0.005195 | 0.025556 | 1 | 0 | 1150934400000000000 | 632332.0 | 274 | 68 | 60 | 2006 | 0 |
| 532820 | 0.038228 | 0.030451 | 0.025518 | 0.049382 | 0.037016 | 0 | 0 | 1387756800000000000 | 603070.0 | 490 | 59 | 56 | 2013 | 0 |
| 532821 | 0.037200 | 0.030451 | 0.025518 | 0.005195 | 0.046430 | 1 | 0 | 1211932800000000000 | 185030.0 | 307 | 68 | 60 | 2008 | 0 |
| 532822 | 0.030481 | 0.030451 | 0.025517 | 0.036367 | 0.039656 | 1 | 0 | 1292371200000000000 | 452155.0 | 97 | 65 | 60 | 2010 | 0 |
| 532823 | 0.002744 | 0.030451 | 0.003306 | 0.036367 | 0.010676 | 0 | 0 | 1288742400000000000 | 393761.0 | 139 | 65 | 55 | 2010 | 0 |
| 532824 | 0.053344 | 0.030451 | 0.064511 | 0.045133 | 0.056457 | 1 | 0 | 1283126400000000000 | 660064.0 | 204 | 65 | 60 | 2010 | 0 |